FS-XCS vs. GRD-XCS: An analysis using high-dimensional DNA microarray gene expression data sets
نویسندگان
چکیده
XCS, a Genetic Based Machine Learning model that combines reinforcement learning with evolutionary algorithms to evolve a population of classifiers in the form of condition-action rules, has been used successfully for many classification tasks. However, like many other machine learning algorithms, XCS becomes less effective when it is applied to high-dimensional data sets. In this paper, we present an analysis of two XCS extensions – FS-XCS and GRD-XCS – in an attempt to overcome the dimensionality issue. FS-XCS is a standard combination of a feature selection method and XCS. As for GRD-XCS, we use feature quality information to bias the evolutionary operators without removing any features from the data sets. Comprehensive numerical simulation experiments show that both approaches can effectively enhance the learning performance of XCS. While GRD-XCS has obtained significantly more accurate classification results than FS-XCS, the latter has produced much quicker execution times than the former.
منابع مشابه
Incorporating feature ranking and evolutionary methods for the classification of high-dimensional DNA microarray gene expression data.
BACKGROUND DNA microarray gene expression classification poses a challenging task to the machine learning domain. Typically, the dimensionality of gene expression data sets could go from several thousands to over 10,000 genes. A potential solution to this issue is using feature selection to reduce the dimensionality. AIMS The aim of this paper is to investigate how we can use feature quality ...
متن کاملGuided Rule Discovery in XCS for High-Dimensional Classification Problems
XCS is a learning classifier system that combines a reinforcement learning scheme with evolutionary algorithms to evolve a population of classifiers in the form of condition-action rules. In this paper, we investigate the effectiveness of XCS in high-dimensional classification problems where the number of features greatly exceeds the number of data instances – common characteristics of microarr...
متن کاملFeature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملAn enhanced XCS rule discovery module using feature ranking
XCS is a genetics-based machine learning model that combines reinforcement learning with evolutionary algorithms to evolve a population of classifiers in the form of condition-action rules. Like many other machine learning algorithms, XCS is less effective on high-dimensional data sets. In this paper, we describe a new guided rule discovery mechanisms for XCS, inspired by feature selection tech...
متن کاملCoXCS: A Coevolutionary Learning Classifier Based on Feature Space Partitioning
Learning classifier systems (LCSs) are a machine learning technique, which combine reinforcement learning and evolutionary algorithms to evolve a set of classifiers (or rules) for pattern classification tasks. Despite promising performance across a variety of data sets, the performance of LCS is often degraded when data sets of high dimensionality and relatively few instances are encountered, a...
متن کامل